#lightweight language models28/04/2025
Tina: USC's Tiny Models Deliver Big Advances in Cost-Effective Reinforcement Learning
USC researchers introduce Tina, a family of compact reasoning models that leverage LoRA and reinforcement learning to deliver strong multi-step reasoning performance at a fraction of typical training costs.